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Law of total probability and Bayes’ theorem in Riesz spaces 

Liang Hong 


Abstract. This note generalizes the notion of conditional probability to Riesz 
spaces using the order-theoretic approach. With the aid of this concept, we 
establish the law of total probability and Bayes’ theorem in Riesz spaces; we 
also prove an inclusion-exclusion formula in Riesz spaces. Several examples 
are provided to show that the law of total probability, Bayes’ theorem and 
inclusion-exclusion formula in probability theory are special cases of our re¬ 
sults. 


1. Introduction 

The study of stochastic processes in an abstract space can date back to as 
early as [ 2 ], Works along this line include [ 4 ], [ 5 ], [ 15 ], [ 26 ] and [ 27 ]. About 
a decade ago, [ 17 ] initiated the study of stochastic processes in a measure-free 
setting. They carefully investigated the fundamental properties of conditional ex¬ 
pectations on a probability space and abstract them to Riesz spaces; this allows one 
to generalize many concepts from probability theory to Riesz spaces. As a result, 
an order-theoretic approach to stochastic process can be formed to generalize the 
classical theory of stochastic processes. Since then this order-theoretic approach to 
stochastic processes has been developing fast. For order-theoretic investigation of 
conditional expectations and related concepts, we refer to [ 14 ], [ 15 ], [ 20 ], [ 31 ]; for 
discrete-time processes in Riesz spaces, we refer to [ 16 ], [ 17 ], [ 18 ], [ 19 ], [ 21 ], [ 22 ], 

[ 23 ] , [ 28 ]; for continuous-time processes in Riesz spaces, we refer to [ 6 ], [ 7 ], [ 8 ], [ 9 ], 
[ 10 ], [ 13 ], [ 29 ], [ 30 ]; for stochastic integrals in Riesz spaces, we refer to [ 11 ], [ 12 ], 

[ 24 ] . 

[ 22 ] extended the notation of independence to Riesz spaces; [ 9 ] gave a slightly 
more general definition of independence in Riesz spaces. In this note, we follow [ 22 ] 
to extend the concept of conditional probability to Riesz spaces and then establish 
the law of total probability and Bayes’ theorem in Riesz spaces; we also prove an 
inclusion-exclusion formula in Riesz spaces. Several examples are given to show 
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that the concept of conditional probability, law of total probability, Bayes’ theorem 
and inclusion-exclusion formula in probability theory are special cases of our results. 

The remainder of this note is organized as follows. Section 2 review some basic 
concepts and results of Riesz spaces and conditional expectations in Riesz spaces; 
for further details, we refer readers to [1], [3], [20], [25], [31] and [32]. Section 
3 generalizes the concept of conditional probability to Riesz spaces; we show that 
this notion leads naturally to the notion of independence in Riesz spaces. Section 4 
establishes the law of total probability, Bayes’ theorem and the inclusion-exclusion 
formula in Riesz spaces. 

2. Preliminaries of Riesz space theory 

A partially ordered set (X, is called a lattice if the infimum and supremum 
of any pair of elements in X exist. A real vector space X equipped with a partial 
order ^ is called an ordered vector space if its vector space structure is compatible 
with the order structure in a manner such that 

(a) if x ^ y, then x + z ^ y + z for any z £ X; 

(b) if x ^ y, then ax ^ ay for all a ^ 0. 

An ordered vector space is called a Riesz space (or a vector lattice) if it is also a 
lattice at the same time. For any pair x, y in a Riesz space, iVy denotes and 
supremum of {x, y}, x Ay denotes the infimum of {x, y}, and |x| denotes x V (—x). 
A Riesz space X is said to be Dedekind complete if every nonempty subset of X 
that is bounded from above has a supremum. A vector subspace of a Riesz space 
is said to be a Riesz subspace if it is closed under the lattice operation V. A subset 
Y of a Riesz space X is said to be solid if |x| < \y\ and y £ Y imply that x £ Y. 

A solid vector subspace of a Riesz space is called an ideal. A net ( x a ) ae A is said 

to be decreasing if a ^ [5 implies x a ^ xp. The notation x a j. x means (x a ) a eA is 
a decreasing net and the infimum of the set {x a \ a £ A} is x. A net (x a ) a ^A in 
a Riesz space X is said to be order-convergent to an element x £ X, often written 
as x a A- x, if there exists another net ( y a )aeA in X such that |x a — x| ^ y a i 0. 
A subset A of a Riesz space X is said to be order-closed if a net {x a } in A order- 
converges to Xo £ X implies that xo £ A. An order-closed ideal is called a band. 
Two elements x and y in a Riesz space X is said to be disjoint if |x| A |j/| = 0 and 

is denoted by xT y. The disjoint complement of a subset A of a Riesz space X is 

defined as A d = {x £ X \ x A y for all y £ A}. A band B in a Riesz space is said to 
be a projection band if X equals the direct sum of B and B d , that is, X = B © B d . 
Every band in a Dedekind complete Riesz space is a projection band. If B is a 
projection band of a Riesz space X and x £ X, then x = xi + X 2 , where xi £ B 
and X 2 £ B d . In this case, the map Pb : X -A X defined by Pb{x) = x\ is called 
the band projection associated with B. 

An additive group G, equipped with a partial order |j is called a partially 
ordered group if its group operation is compatible with the order structure in a 
manner such that 

(a) if x ^ y, then x + 2 ^ y + z for any z £ G; 

(b) if x ^ y, then z + x ^ z + y for any z £ G. 
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A partially ordered group is said to be a lattice-ordered group if it is also a lattice 
at the same time. Lattice-ordered rings and lattice-ordered fields are defined in a 
similar manner. Specifically, a ring R equipped with a partial order ^ is called a 
partially ordered ring if its ring operations are compatible with the order structure 
in a manner such that 

(a) if x ^ y, then x + z ^ y + z for any z € R; 

(b) if x ^ y and z > 0, then xz ^ yz and zx ^ zy. 

A partially ordered ring is said to be a lattice-ordered ring if its additive group 
is a lattice-ordered group. Likewise, a field F equipped with a partial order ^ is 
called a partially ordered field if its field operations are compatible with the order 
structure in a manner such that 

(a) if x ^ y, then x + z ^ y + z for any z € F; 

(b) if x > y and 2 > 0, then xz > yz; 

(c) 1 > 0, where 1 is the identity element of F. 

A partially ordered field is said to be a lattice-ordered field if its additive group is 
a lattice-ordered group. 

A linear operator T on a vector space A is said to be a projection if T 2 = T. 
A linear operator T between two Riesz spaces X and Y is said to be positive if 
x € X and x fz 0 implies T(x) ^ 0; T is said to be strictly positive if x G X and 
x > 0 implies T(x) > 0; T is said to be order-continuous if x a A- 0 in A implies 
T(x a ) A 0 in F. A linear operator T on a vector space V is called an averaging 
operator if T(yT(x)) = T(y)T(x) for any pair x,y € V such that yT{x) £ V. 

Following [17] and [20], we say a positive order-continuous projection on a 
Riesz space X with a weak order unit e is a conditional expectation if the range 
R(T) of T is a Dedekind complete Riesz subspace of X and T(e) is a weak order 
unit of X for every weak order unit e of X. If T is a conditional expectation on a 
Riesz space A, then there is a weak order unit e such that T(e) = e. 


3. Conditional probability in Riesz spaces 

To extend the notion of conditional probability in probability theory to Riesz 
spaces, we first recall the following definition. 


Definition 3.1. Let (D, A, P) be a probability space. Suppose A and B be two 
events such that P{B) > 0. Then the conditional probability P(A \ B) of A given 
B is defined as 


P(A | B) 


p{A n b) 

~P(B) 


Intuitively, P(A \ B ) measures the relative size of the projection of A onto B. 
This motivates the following definition. 


Definition 3.2. Let E be a Dedekind complete lattice-ordered field with a con¬ 
ditional expectation T and a weak unit e such that T(e) = e. Suppose B 1 and B 2 
are two projection bands, Pb 1 and Pb 2 are the corresponding band projections of 
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B\ and B 2 , respectively, and B\ n £?2 7 ^ <t>- Then the conditional probability Pb 1 \b 2 
of Bi given B 2 with respect to T is defined as 

(3.1) P(B 1 I B 2 ){f) = [TPb 2 P B \ (/)] [TPb 2 (/)] — 1 ; V/ e E. 

provided TPs 2 (f ) 7 ^ 0 . 

Remark. We require £ to be a lattice-ordered field because we need the multi¬ 
plication operation on E to be invertible. It is evident that if P Bl = Pb 2 then 
-Pbi|b 2 = !■> where I is the identity operator on E. Since two band projections on 
a Riesz space are commutative, Equation (3.1) may be replaced by 

P(B 1 I B 2 ){f) = [TP Bi PbM)} [TPb 2 U)\~\ V/ e E, 
provided TPs 2 (f) 7 ^ 0 . 

The following example shows that the classical definition of conditional prob¬ 
ability is a special case of Definition 2.1. 

Example 2.1. Let E be the family of all random variables with finite mean on 
a probability space (12, T, P), that is, E = L 1 (12, J 7 , P). Then e = In, where 1a 
denotes the indicator function of an event A. Let T be the expectation operator. 
For two events A, B £ T with B ^ (j>, the bands generated by 1a and 1 b are 
Bi = L'fljJnd,?) and B 2 = L 1 (B, E PI B, P), respectively. Therefore, the 
corresponding band projections Pb 1 and Pb 2 are defined as P Bl (f) = fl a and 
Pb 2 (/) = /Is- We have 

TP B2 (e) =E[l a l B ] =P(B), 

and 

TP B2 P Bl (e) = E[l n l B l A \ = P(An B). 

It follows that P{B\ | B 2 ) = P{A \ B) which is consistent with the classical defini¬ 
tion of conditional probability of A given B. 

Recall that in probability theory two events A and B are defined to be indepen¬ 
dent if P{A | B) does not depend on B and P{B \ A) does not depend on A , that 
is P(A | B) = P(A) and P(B \ A) = P(B). To accommodate the case P(B) = 0, 
we usually define two events A and B to be independent if P{AC\B) = P(A)P(B). 
Thus, Definition 3.2 leads naturally to the following concept of independence in 
Riesz spaces. 

Definition 3.3. Let E be a Dedekind complete lattice-ordered ring with a con¬ 
ditional expectation T and a weak unit e such that T(e) = e and the multiplication 
operation coincide with the lattice operation A. Suppose B\ and B 2 are two pro¬ 
jection bands of E and P Bl and Pb 2 are the associated band projections of B\ 
and B 2 , respectively. We say B\ and B 2 are independent with respect to T if 
T Pb 2 Pb 1 (e) = T Pb 2 (e)T P Bl (e). 

Remark. In Definition 3.3, we assume that £ is a Dedekind complete lattice- 
ordered ring and its multiplication is the lattice operation A. Definition 4.1 in [22] 
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defines two projection bands Pb 1 and Pb 2 to be independent with respect to a 
conditional expectation T in a Dedekind complete Riesz space if 

TPb 1 TPb 2 (e) = TP Bi Pb 2 ( e) = TP B2 TP Bl (e). 

As [23] pointed out, the space of e bounded elements in E will admit a natural 
f-algebra structure if we set P(e)Q(e) = PQ{e ) for two band projections P and Q. 
In view of this, our definition and Definition 4.1 in [22] are consistent. 

4. Law of total probability and Bayes’ theorem in Riesz spaces 

In probability theory, the law of total probability and Bayes’ theorem are two 
fundamental theorems involving conditional probability. In this section, we will 
extend these two theorems to Riesz spaces. We will also give examples to show 
that the classical law of total probability and Bayes’ theorem are special cases of 
law of total probability and Bayes’ theorem in Riesz spaces. To motivate our study, 
we first recall the law of total probability in probability theory: 

Theorem 4.1. Suppose B i,..., B n are mutually disjoint events on a probability 
space (f1, J-, P) such that Q = Ui = iBi and P(Bi ) > 0 for each 1 ^ i ^ n. Then for 
any event A 

n 

P{A)=Y J P{A\B i )P{B i ). 

i= 1 

Since P(A \ Bi)P(Bf) = P(AnBi), the intuition of law of total probability can 
be described as follows: If an event A is the union of n mutually disjoint sub-events 
An Bi, then the projection onto A is equivalent to the sum of all projections onto 
A n Bf s, that is, 

n 

p(d) = ^P(An5,). 

i=l 

This intuition extends naturally to the abstract case of Riesz spaces as the next 
theorem shows. To this end, we first state and prove an inclusion-exclusion formula 
for projection bands; this result extends the classical inclusion-exclusion theorem 
for events to Riesz spaces. 


Lemma 4.1 (Inclusion-exclusion formula for projection bands). Let B \,..., B n 
be n projection bands in a Riesz space. Then B\ + ... + B n is a projection band and 


Pbx+...+b u 


-Psij r\Bi. 

i— 1 <22^^- 


+- + (- 1 ) fe 1 PB il n...r\Bi k 

+■•■ + ( — 1 )" 1 -f > Bin...nB„- 


(4.1) 
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PROOF. The fact that B\ + ... + B n is a projection band is trivial. We show 
Equation (4.1) by induction on n. The case n = 2 follows from Theorem 1.45 of 
[1]. Suppose Equation (4.1) holds for n — 1. Then the induction hypothesis implies 


PB 1 + ...+B n 


= PBi + ...+Bn-i + Pb„ 

+ 

Cq 

+B n . 

-i Pb 




n —i 

= E^- 

E 

Pb h ns i2 

+ ... 





2=1 

1^21 <22^21 







+(-l) fe - 

E 

p B H r\ 

..ns 

k + ■ 

•• + ( 

-1)” 

2 -PBin...ns„_i 


1^21 <...<2* 

^.n 






+ P B„ - -Pbi + ...+B„_ 

-, P B n 






n— i 

= E^- 

E 

p Bi 1 nB i2 

+ ... 





2=1 

1^21 <22^21 







+(-D fe - 

E 

p B ix n 

..ns 

* + ■ 

•■ + ( 

-l) n 



l<2l<...<2* 

^n 






+ P B n ~ 

- 22—1 

E^- 

E 

Pb 


+ ... 




L 2=1 

1^2i<22^ 

n 





+ (-l) fc - 

E 

PBi 

l n...nB ik _ 1 

+ ... 

+ (- 

-l)"^ 2 PB in ...nB 




n— 1 

= ^ PBi + PB n 

2=1 
n— 1 

~C52 P Bi)PB n - E P B H nB i2 

2 = 1 1^2i<22^72 

+... + 

(-l) fc "i E P Bil n...nB ik + (-l) 1 *- 1 E Pbh n.., 

l^ii <...<2fc^n 1^2i <...<2fc_i^n 

+... + 

+ (-l)" _1 P B in...nB n _if > B n 

n 

= ^2 P Si- E P Bi x nB i2 

2=1 1^2i<22^n 

+... + (-l) fc “ 1 E PB^n-.nB^ 

<...<2fe^n 

+•■• + ( — 1 )" 1 ^ > Bin...nB„- 


By mathematical induction, (4.1) holds for all positive integer n. 


□ 


Theorem 4.2 (Law of total probability in Riesz spaces). Let E be a Dedekind 
complete lattice-ordered field with a conditional expectation T and a weak unit e 
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such that T[e) = e. Suppose B \,..., B n are n mutually disjoint nonempty projection 
bands such that E = Bi © B 2 ® ... ® B n and PB t is the band projection onto Bi. 
Then for any projection band D and its associated band projection Pd 

n 

(4.2) TP D (f)=J2(PD\B i (f))(TPB i (f)), V/GB. 

i -1 

provided TP Bi (f) ^ 0 for all 1 ^ i ^ n. 


Proof. Definition 3.2 shows that the right-hand side of Equation (4.2) equals 


J2 T (PDP Bi ) 

i =1 


T 


'y~](PpnB i ) 

. 1=1 


Since Bfs are mutually disjoint, PnrBi D PonBj = 0 for i / j. It follows from 
Lemma 4.1 that 


T 


y^(PpnBi) 

_i =1 


T [( p D n(E? =1 Bi)] 


TP d . 


Therefore, the theorem is established. 


□ 


Theorem 4.1 and Definition 3.2 immediately imply the following Bayes’ Theo¬ 
rem in Riesz spaces. 


Corollary 4.1 (Bayes’ Theorem in Riezs spaces). Let E be a Dedekind com¬ 
plete lattice-ordered field with a conditional expectation T and a weak unit e such 
that Te = e. Suppose B \,..., B n are n mutually disjoint nonempty projection bands 
such that E = B\ © P 2 0 © B n and P Bi is the band projection onto Bi(l ^ ^ n). 

Then for any projection band D and any 1 ^ j ^ n 


P Bj \p{f) = [(PdmUMTPbM 


n 


-1 


E( P B| Bl (/))(rP Bi (/)) 


V/ e E, 


provided TP Bi (f) ^ 0 for all 1 ^ i ^ n. 


Example 3.1. Take E = L 1 (ST, P, P). Then e = 1 q. Suppose T is the expectation 
operator and P B fs are defined as P Bi (f) = 1 Bi f for / S E, where Bi,B 2 , -jBn 
are n disjoint nonempty events such that D = U™ =1 Pi. Also, for any event D 
define P D (f) = l D f for / G E. Then TP D {e) = P(D),TP Bi {e) = P(P*) and 
Pp\ Bi (e) = P{D | Bi ); thus, Theorem 4.2 specializes to Theorem 4.1. Moreover, we 
have P Bl+ ... +B „(e) = P(U'L 1 P i )>- p B i (e) = P(Pj), Ps 4 l nB i2 (e) = P(Pq nB, 2 ),...; 
hence Lemma 4.1 specializes to the classical inclusion-exclusion formula. Finally, 
in this case Corollary 4.1 yields 


P{Bj | D) = 


P{D | P,)P(P J ) 


EtiP(D\B i )P(B i y 

therefore, Corollary 4.1 specializes to the classical Bayes’ theorem. 
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